Complete stability analysis of a heuristic approximate dynamic programming control design

نویسندگان

چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Complete stability analysis of a heuristic approximate dynamic programming control design

This paper provides new stability results for Action-Dependent Heuristic Dynamic Programming (ADHDP), using a control algorithm that iteratively improves an internal model of the external world in the autonomous system based on its continuous interaction with the environment. We extend previous results for ADHDP control to the case of general multi-layer neural networks with deep learning acros...

متن کامل

Complete stability analysis of a heuristic ADP control design

This paper provides new stability results for Action-Dependent Heuristic Dynamic Programming (ADHDP), using a control algorithm that iteratively improves an internal model of the external world in the autonomous system based on its continuous interaction with the environment. We extend previous results by ADHDP control to the case of general multi-layer neural networks with deep learning across...

متن کامل

Stability analysis of heuristic dynamic programming algorithm for nonlinear systems

In this paper, a value-iteration based heuristic dynamic programming (HDP) algorithm is developed to solve the optimal control for the continuous time affine nonlinear systems. First, a rigorous convergence proof of the HDP algorithm is given. Second, stability issues of the HDP algorithm for nonlinear systems are investigated. It is commonly believed that the main drawback of the HDP algorithm...

متن کامل

Reinforcement Control via Heuristic Dynamic Programming

Heuristic Dynamic Programming (HDP) is the simplest kind of Adaptive Critic which is a powerful form of reinforcement control 1]. It can be used to maximize or minimize any utility function, such as total energy or trajectory error, of a system over time in a noisy environment. Unlike supervised learning, adaptive critic design does not require the desired control signals be known. Instead, fee...

متن کامل

Approximate Dynamic Programming for Ship Course Control

Dynamic programming (DP) is a useful tool for solving many control problems, but for its complexity in computation, traditional DP control algorithms are not satisfactory in fact. So we must look for a new method which not only has the advantages of DP but also is easier in computation. In this paper, approximate dynamic programming (ADP) based controller system has been used to solve a ship he...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Automatica

سال: 2015

ISSN: 0005-1098

DOI: 10.1016/j.automatica.2015.06.001